
SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR 
REFORMATTING NON-XML DATA FOR USE WITH INTERNET BASED SYSTEMS 



1. Technical Field 

The present invention relates generally to computer database software and more 
specifically to a parameterized database system query. 

2. Background Of The Invention 

Many companies store data on database systems such as the present assignee's DB2 
relational database system. DB2 is a registered trademark of International Business Machines 
Corporation in the United States, other countries, or both. The DB2 system is a relational 
database system that facilitates fast and efficient data extraction using queries written in a 
language known as Structured Query Language (SQL). As a simple example, using an SQL 
statement a relational database system can be used to search for all employees in the database 
that have salaries in excess of $50,000 and that are in the engineering department of the 
company. The portions of the statement that specify the salary and department search criteria 
are referred to as "parameterized" portions of the statement. 

It is increasingly the case that companies wish to permit certain customers (or indeed 
the public at large) to access a company's database. This is possible owing to the Internet, 
because a database can be made accessible via the Internet. As an example of when a 
company might want to allow access, a bank might want to allow its customers to access 
stock quotes that are frequently updated and available in a database. This can be thought of 
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as a database query. As another example, a library might wish to allow book borrowers to 
enter the library database and change their address when they move, without requiring 
interfacing with library personnel. This can be thought of as an update. 

Prior systems and methods for allowing selected outside entry into a Web-accessible 
internal database suffer from several drawbacks. Some of these previous systems, such as the 
Common Object Request Broker Architecture (CORBA), were initially designed for internal 
data transfer, and accordingly did not use Hypertext Transfer Protocol (HTTP). One 
consequence of this is that since non-HTTP data cannot pass through computer firewalls, 
systems like CORBA are ineffective for a great many customers whose computers reside 
behind corporate or Internet Service Provider (ISP) firewalls. Other systems and methods 
require specific tailoring for each database system sought to be accessed, which requires the 
access system to be inflexibly bound to a particular database management system (DBMS) 
language. 

SUMMARY OF THE INVENTION 

Having recognized the above drawbacks, the preferred embodiment of the present 
invention provides the solutions noted below to one or more of them. 

A method for permitting a client to access a database system on a server via an 
Internet connection includes providing middleware communicating with the client and server. 
The middleware includes a file that has a parameterized database system query language 
statement. The file is accessed in response to a client request for data. Using the 
parameterized statement, data is returned that satisfies the request to the client. 



Preferably, the database system query language is SQL. Moreover, the data that 
satisfies the request is sent via HTTP in Extensible Markup Language (XML). In a preferred 
embodiment, the parameterized statement facilitates a query for data. On the other hand, the 
parameterized statement facilitates an update to the database system or facilitates an SQL 
stored procedure call. 

In a preferred embodiment, the client request is a Simple Object Access Protocol 
(SOAP) request, an HTTP GET request, or an HTTP POST request. Preferably, the 
middleware includes a router that receives the client request. The router includes a servlet 
that replaces some parameters in the parameterized statement with corresponding values from 
the client request to establish a fully qualified SQL statement. The servlet sends the SQL 
statement to the database system for' execution thereof. 

Preferably, the database system generates a response to the SQL statement, and if 
necessary, the servlet converts the response to XML. In a preferred embodiment, the servlet 
generates documentation and a test page in Hypertext Markup Language (HTML) based on 
the client request. The servlet also generates a service description (Web Services Description 
Language (WSDL) file) based on the parameterized SQL statement. Moreover, if necessary, 
the servlet incorporates a document access definition (DAD) file into the client request. The 
servlet can also generate Extensible Markup Language Schema data (XSD) based on the client 
request. 

In another aspect of the present invention, a computer system includes a client and 
middleware that is accessible to the client. The system also includes a file that is accessible 



to the middleware. In this aspect, the file contains a parameterized statement. The computer 
system also includes a database system that is accessible to the middleware. 

In yet another aspect of the present invention, a method for permitting a client to 
access a database system on a server via a wide area network includes providing a file 
between the client and database system. The file has at least one parameterized SQL 
statement. In this aspect, the parameters in the parameterized statement are replaced with 
corresponding values from the client request to establish a fully qualified SQL statement. The 
SQL statement is sent to the database system where it is executed. A response to the SQL 
statement is generated at the database system, and then sent to the client. If necessary, the 
response is converted to XML before it is sent to the client. 

In still another aspect of the present invention, a computer program device includes a 
computer readable means having logic means for facilitating data communication between a 
client and a database at a server. The computer readable means includes logic means for 
receiving a client request for data. Moreover, the computer readable means includes logic 
means for accessing a file having a parameterized SQL statement based on the client request. 
In this aspect, the computer readable means also includes logic means for returning a response 
to the client in XML over HTTP based on the parameterized SQL statement. 

The present invention will now be described, by way of example, with reference to the 
accompanying drawings, in which: 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram of a system architecture; 



Figure 2 is a flow chart of the processing logic of the preferred embodiment of the 
present invention; 

Figure 3 is an example of a specific DADX file; and 
Figure 4 is the general schema for a DADX file. 

DESCRIPTION OF AN EMBODIMENT OF THE INVENTION 

Referring initially to Figure 1, a system is shown and generally designated 10. As 
shown in Figure 1, the system includes at least one client computer 12 connected to a router 
14. Preferably, the connection between the client computer 12 and the router 14 is 
established via the Internet, but other connections can be used. It can be appreciated that the 
client computer 12 connects to the Internet via telephone modem, cable modem, local-area 
network (LAN), wide-area network (WAN), Tl or any other means well known in the art. 
Figure 1 shows that the router 14 includes a DB2 XML extender administrative servlet (DXX 
admin servlet) 16. The DXX admin servlet 16 configures at least one DXX invoker 18, 
which in turn can access at least one document access definition extension (DADX) file 20. 
The DADX file 20, in turn, can access one or more DAD files 22. 

DXX is a collection of stored procedures, user defined types (UDTs), and user defined 
functions (UDFs) that are typically used in programs written in SQL, Java, C++, and other 
languages. The DXX invoker 18 handles universal resource locators (URLs) having the 
extensions dadx" that reference web services and "dtd" that reference document type 
definition (DTD) documents. Specifically, the DXX invoker 18 provides runtime support for 
invoking DADX documents as web services written, e.g., in Apache Simple Object Access 



Protocol (SOAP). As shown, the servlet 16 of the router 14 connects to a database 24 at 
invocation times. Preferably, this connection is established by a Java database connection 
(JDBC). 

In a preferred embodiment, the DXX Invoker 18, a Java component, interfaces with 
Apache SOAP 2.1 runtime using a pluggable provider support. A web application developer 
creates an instance of the DXX Invoker 18 for each database 24 that is to be accessed. Each 
instance of the DXX Invoker 1 8 is associated with a database connection and a set of DADX 
files. The DXX Admin servlet 16 is provided to simplify the task of creating instances of the 
DXX Invoker 18. Specifically, the DXX Admin servlet 16 provides a web user interface for 
configuring instances of the DXX Invoker 18. Someone, e.g., a database administrator, sets 
up the databases and enables them for use by DXX. The service provider creates DAD and 
DADX documents and deploys them to the web application. Each DADX document is 
associated with a URL that identifies a specific web service. 

The logic of the present invention may be contained on a data storage device with a 
computer readable medium, such as a computer diskette. Or, the instructions may be stored 
on a magnetic tape, conventional hard disk drive, electronic read-only memory, optical storage 
device, or other appropriate data storage device or transmitting device thereby making a 
computer program product, i.e., an article of manufacture according to the invention. In an 
illustrative embodiment of the invention, the computer-executable instructions may be lines of 
Java code. 

The flow charts herein illustrate the structure of the logic of the present invention as 
embodied in computer program software. Those skilled in the art will appreciate that the 
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flow charts illustrate the structures of computer program code elements including logic circuits 
. on an integrated circuit, that function according to this invention. Manifestly, the invention is 
practiced in its essential embodiment by a machine component that renders the program 
elements in a form that instructs a digital processing apparatus (that is, a computer) to 
5 perform a sequence of function steps corresponding to those shown. 

It is to be understood that throughout the remainder of this document, any reference to 
"client" includes the client computer 12. Any reference to "middleware" includes the portion 
of the system 10 including the router 14, the DXX admin servlet 16, the DXX invoker 18, the 
DADX document 20, and the DAD document 22. Also, any reference to "server" includes 
DO the database 24. 

Referring to Figure 2, the processor operational logic of the present invention is shown 
and begins at state 30. At block 32, an HTTP service request is received. For example, the 
service request can be a client price request for a particular product - the price of the product 
being contained in a database of prices at the server. The request specifies the method (HTTP 



m 5 GET or HTTP POST). If the method is POST, the request also specifies the content type 
O (XML or URL encoded). The request URL specifies the target resource file (DADX or DTD) 
and the action (an operation defined in the DADX file or a command which is one of: 
SOAP, TEST, XSD, WSDL, WSDLservice, or WSDLbinding.) For example, the URL 
"http://services.myco.com/sales/PartOrders.dadx/TEST" specifies the DADX resource file 
20 named PartOrders.dadx and the action TEST. If the action is an operation, the request also 
contains the input parameters. 
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Continuing the description of the logic, at block 34, the resource (DADX or DTD) 
specified by the request is loaded. Next, at decision diamond 36, it is determined whether the 
method is POST or GET. If the method is POST, the logic moves to decision diamond 38 
where it is determined whether the POST request is XML or URL encoded. If the POST 
5 request is XML, the logic moves to decision diamond 40 where it is determined whether the 
action is SOAP. If the action is SOAP, the logic continues to block 42 where the SQL 
parameters are replaced with values from the request. From block 42, the logic proceeds to 
decision diamond 44 where it is determined whether DAD is used. DAD might be used if the 
request requires storing or retrieving XML. If so, the DAD is loaded at block 46. Thereafter, 

□0 at block 48, the SQL statement is sent to the database. Proceeding to block 50, the SQL 
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0- result is formatted as XML. It is to be understood that the SQL result can be formatted as 
SI 

^ XML at the database or at the router. At block 52, an XML response is returned to the 

{f\ service requestor. The logic then ends at state 54. At decision diamond 44, if DAD is not 

O used, the logic moves to block 48 and continues as described above. 

in 

y!5 Returning to decision diamond 40, if the action is not SOAP, the logic moves to block 
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3 56 where an error response is returned to the service requestor. The logic then ends at state 
54. If, at decision diamond 38, the POST request is URL encoded, the logic proceeds to 
decision diamond 58 where it is determined whether the action is an operation. Similarly, at 
decision diamond 36, if the method is GET, the logic also proceeds to decision diamond 58. 
20 At decision diamond 58, if the action is an operation, the logic continues to block 42 and 
continues as described above. On the other hand, if the action is not an operation, the logic 
proceeds to decision diamond 60 where it is determined whether the action is XSD. If so, an 
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XSD response is generated, at block 62, from the resource specified by the request. For 
example, if the resource is a DTD, it is converted to XSD, or if the resource is DADX, the 
XSD is generated for the message types used by the resource's WSDL interface. Regardless, 
after the XSD is generated, the logic proceeds to block 52 where the response is returned to 
the service requester. The logic then ends at state 54. 

At decision diamond 60, if the action is not XSD, the logic proceeds to decision 
diamond 64 where it is determined whether the action is TEST. If so, at block 66, an HTML 
documentation and test page response are generated. Thereafter, at block 52, this response is 
returned to the service requestor. The logic then ends at state 54. 

Returning to decision diamond 64, if the action is not TEST, the logic continues to 
decision diamond 68 where it is determined whether the action is one of: WSDL, 
WSDLservice, or WSDLbinding. If so, the logic moves to block 70 where a WSDL response 
is generated that includes the appropriate content. For example, if the action is WSDL, a self- 
contained WSDL document is generated. Or, if, e.g., the action is WSDLservice, only the 
WSDL service elements are generated. Furthermore, if, e.g., the action is WSDLbinding, the 
WSDL binding elements and any elements that they reference are generated. It can be 
appreciated that the WSDLservice and WSDLbinding actions are useful for working with 
Universal Description, Discovery and Integration (UDDI) registries. After the WSDL 
response is generated, the logic moves to block 52, where the response is returned to the 
service requestor. The logic then ends at state 54. If, at decision diamond 68, the action is 
not WSDL, the request is invalid and an error response is returned to the requestor. Then, the 
logic ends at state 54. 



Referring now to Figure 3 a non-limiting, exemplary DADX file related to a parts 
database is shown. This particular DADX file implements three operations: "findAll" 70, 
"findByColor" 72, and "findByMinPrice" 74. All three operations are implemented using the 
stored procedure DXXGenXML, which works with the XML collection method. It is to be 
appreciated that each operation may use a different implementation and access method. As 
shown in Figure 3, the DADX file refers to the same DAD, "getstart_xcollection.dad" 76, for 
each operation. Moreover, each operation specifies an SQL override 78, 80, 82 which 
replaces the parameterized SQL statement 84, 86, 88 defined in the DAD file. Although the 
SQL statement is overridden, the new SQL statement produces a data set that is compatible 
with the SQL mapping defined in the DAD file. For example, the column names that appear 
in the DAD file must also appear in the SQL override. 

Figure 3 also shows that the WHERE clauses of the SQL statements within each 
operation are modified to include search conditions. For example, within the "findByColor" 
operation 72, the WHERE clause has been modified as follows: "where p.order_key = 
o.orderjcey and s.part_key = p.partjcey and color = xolor." Within the "findByMinPrice" 
operation, the WHERE clause has been modified to include "p.price>= :minprice." The 
Dxxlnvoker extracts the parameters from the request, parses and validates them, converts them 
to SQL syntax and substitutes them for the host variables in the SQL override. 

Figure 4 on the other hand sljows the general schema for any DADX file. Figure 4 
shows a complete set of operations supported by DADX. It is to be understood that the XML 
collection operations use DXX stored procedures. Moreover, it is to be understood that the 
SQL operations use normal SQL SELECT, UPDATE, INSERT, DELETE, and CALL 
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statements, and can be used for XML column operations by employing the DXX UDTs and 
UDFs. When using the SQL operations, parameters can be defined using XSD elements as 
well as simple types. For the query operation, XSD elements can be associated with column 
values in the result set. For CALL operations, the parameters can be declared as IN, OUT, or 
IN/OUT. 

The XML collection operations include retrieve XML and store XML. Figure 4 shows 
the definitions of these operations. The other SQL operations include query, update, and call. 
The definitions and specifications that refer to these operations include: <element 
ref="dadx:query"/> 104, <element ref="dadx:update7> 106, and <element ref="dadx:caH7> 
108. These statements have the format: <element ref= "dadx:parameter" minOccurs="0" 
maxOccurs="unbounded'7> 110. Specifically, these statements can occur after the 
"SQL_override" statement 112, after the "SQL_query" statement 114, after the "SQL_update" 
statement 116, and after the "SQL_caH" statement 118. During operation, the parameter 
values are provided by the client. If necessary, the results corresponding to the search term 
can be processed at the router by the DXX Invoker 1 8 so that they can be transmitted through 
the firewall back to the client. 

While the particular SYSTEM, METHOD, AND COMPUTER PROGRAM 
PRODUCT FOR REFORMATTING NON-XML DATA FOR USE WITH INTERNET 
BASED SYSTEMS as herein shown and described in detail is fully capable of attaining the 
above-described objects of the invention, it is to be understood that it is the presently 
preferred embodiment of the present invention and thus, is representative of the subject matter 
which is broadly contemplated by the present invention, that the scope of the present 
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invention fully encompasses other embodiments which may become obvious to those skilled 
in the art, and that the scope of the present invention is accordingly to be limited by nothing 
other than the appended claims, in which reference to an element in the singular is not 
intended to mean "one and only one" unless explicitly so stated, but rather "one or more." All 
structural and functional equivalents to the elements of the above-described preferred 
embodiment that are known or later come to be known to those of ordinary skill in the art are 
expressly incorporated herein by reference and are intended to be encompassed by the present 
claims. Moreover, it is not necessary for a device or method to address each and every 
problem sought to be solved by the present invention, for it is to be encompassed by the 
present claims. Furthermore, no element, component, or method step in the present disclosure 
is intended to be dedicated to the public regardless of whether the element, component, or 
method step is explicitly recited in the claims. No claim element herein is to be construed 
under the provisions of 35 U.S.C. section 112, sixth paragraph, unless the element is expressly 
recited using the phrase "means for." 
WE CLAIM: 
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